2023-08-25 11:19:29.AIbase.817
Aliyun Tongyi Qianwen Open Sources Again: Multimodal Large Model Qwen-VL
Qwen-VL is a large-scale vision language model launched by Aliyun that supports image and text input. Compared to other vision language models, Qwen-VL has added capabilities such as visual localization and image text understanding. Qwen-VL has received over 3,400 stars on GitHub and has been downloaded more than 400,000 times.